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Abstract 

The problem of parameter estimation by the continuous time obser¬ 
vations of a deterministic signal in white gaussian noise is considered. 

The asymptotic properties of the maximul likelihood estimator are de¬ 
scribed in the asymptotics of small noise (large siglal-to-noise ratio). 

We are interested by the situation when there is a misspecification in 
the regularity conditions. In particular it is supposed that the statis¬ 
tician uses a discontinuous (change-point type) model of signal, when 
the true signal is continuously differentiable function of the unknown 
parameter. 
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1 Introduction 


Consider the problem of parameter estimation by the observations of the 
signals in White Gaussian Noise (WGN) 

dXt = S(d,t)dt + edWt, Xq = 0, 0 < f < T. 
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Here S (i^, t) is deterministic known signal and we have to estimate the param¬ 
eter -d G 0 = (a, /3) by continuous time observations X'^ = {Xf, 0 < t < T). 
We are interested by the asymptotic behavior of estimators of this parameter 
in the asymptotics of small noise, i.e.; as £ —)■ 0. It is known that if the signal 
S (i9, ■) is a smooth function of 'd with hnite Fisher information 



( 1 ) 


then the maximim likelihood estimator -dg is consistent, asymptotically nor¬ 
mal with the rate of convergence e and asymptotically efficient 0, 0, CS!. 
Here in the sequel dot means derivation w.r.t. d. 

The situation changes if the signal S ("d, t) = S {t — 'd), where S (t) is a 
discontinuous function of t, say, has a jump at the point f = 0. Then the 
Fisher information does not exist and the properties of the estimators are 
essentially different. For example, the MLE is consistent, has non gaussian 
limit distribution with the rate of convergence and asymptotically efficient 
are bayesian estimators [B]. 

Let us recall that there is always a gap between mathematical model de¬ 
scribing the results of observations and the model which corresponds exactly 
to these observations. Sometimes the difference is not important and the 
theoretical results are in good agreement with the real data and sometimes 
the difference can be essential. “All models are wrong, but some are useful” 
(G.E.P. Box). 

We are interested by the situations, where the choosen models are not 
indeed useful, i.e.; there is a misspecihcation. This misspecihcation con¬ 
cerns not only the choice of the signal in the family of models close to the 
model of real data, but we suppose that even the regularity conditions as¬ 
sumed by the statistician are wrong. In particularly, the observed signal 
S ('d, t) is smooth with respect to the unknown parameter ■&, but the signal 
choosen by the statistician M (-d, t) is discontinuous. Our goal is to describe 
the properties of the corresponding pseudo-MLE. We remind the well-known 
property of this estimator that it converges to the value d, which minimizes 
the Kullback-Leibler distance. Then we study its limit distribution and show 
that it converges to non gaussian limit law with the rate Remind that 
the real signals in radiophysics can not have exactly rectangular form due to 
well-known physical law and each time when the change-point type signal is 
used to describe the real signal we are in situation of approximation [B]. This 
approximation can be good or bad depending on the front of the signal and 
the level of signal-to-noise ratio. 


2 


We consider as well in some sense inverse problem, where the theoretical 
model is smooth and the real data model is discontinnous and we describe 
the asymptotics of the psendo-MLE as e —)■ 0. We show that in this case 
the estimator de converges to the point "d which minizes the Knlback-Leibler 
distance and is asymptotically normal with the rate e. 

At the end we describe the conditions on the misspecified model (discon- 
tinuons vs discontinuons) which allow neveretheless to prove the consistency 
(trne) of the psendo-MLE. 


2 Auxiliary results 

Let ns consider the problem of parameter estimation by the observations 
(in continnous time) of the deterministic signal in the presence of White 
Gaussian Noise (WGN) of small intensity 

dXt = S{^,t)dt + edWt, Xq = 0, 0 < f < T, (2) 

where the unknown parametr -d G 0 = (a, /?). We suppose for the simplicity 
of exposition that this parameter is one-dimensional and that a and (3 are 
finite. We are interested by the behavior of the estimators of this parameter 
in the asymptotics of small noise, i.e., as e ^ 0. It is well known that in the 
case of the smooth (w.r.t. "d signal S (d, t) the maximum likelihood estimator 
(MLE) de and bayesian estimator (BE) d^ are consistent, asymptotically 
normal 

^-1 (^4 _ ^ w (0,1 (d)-'), £-1 (d, - d) ^ W (0,1 (d)-'), 

we have the convergence of all polynomial moments: for any p > 0 

= i(d)“2Eicr 

and the both estimators are asymptotically efficient. Here I (d) is the Fisher 
information ([T]). 

Suppose that the signal S (d, t) has cusp-type singularity, say, S (d, t) = 
|f — 6*1'', where 0<Q;<d</d<T and k G (0, i). Then the Fisher in¬ 
formation is oo and we have a singular problem of parameter estimation. 
Introduce the Hurst parameter H = k ^ and double-side fractional Brow¬ 
nian motion (fBm) {u) ,u & R. It can be shown that the MLE and BE 


lim 

£—^0 


df — d 


= I(d)-=E|C| 


lim E ,9 

£—^0 
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are consistyent, have limit distributions 


£ » - i9j e H 

the polynomial moments converge : for any p > 0 

p 




lim 

£—^0 


■dp — -d 


_ 2 _ 

EH 


= E 


lim E^ 

£—^0 


■dp — -d 


_ 2 _ 

EH 


= E 


and the BE are asymptotically efficient. Here the random variables ^ and ^ 
are defined by the relations 



sup Z (u), 

ueR 


i 


f uZ (m) du 
f Z (u) du 


and the process 

Z (u) = exp (u) - y , 


u e R, 


where 7 ^ is some constant. The proofs can be carried out using the general 
methods developped in [7] as it was done in |1] in similar problem. 

Suppose now that the function S t) has discontinuity, say, 


S = h{t) + 9 (f) 0 < a <d < ^ <T, 

where h{t) ^ g (t) for t G (a, (3). 

It can be shown that the MLE and BE are consistent, have limit distri¬ 
butions 


e ^ (-dp — -d 


e ^ (-dp — -d 


^ V, 

the polynomial moments converge : for any p > 0 

p 


lim Eij 

£—^0 


-dp — -d 




= E 


lim E ,9 

£—^0 


■dp — -d 


= E|7|^ 


and the BE are asymptotically efficient. Here the random variables 17 and rj 
are defined by the relations 

. s X - f uZ (u) du 

Z {f}) = sup Z (u), 7 = P 

u&R J Z [u) du 
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and the process 


Z (n) = exp I (-d) ly (n) — ^ 1^1 j; u E R, 

where 5 {d) = h {d) — g{d). 

We are intrerested by the following problem of misspecihcation. Suppose 
that model of observations the choosen by the statistician is 

dW = M(^?,f)df+ £dWi, Wo = 0, 0<f<T 

and the real data model (Ej) are different. Especially we study the situations 
where the regularity conditions of these models do not coincide. For example, 
the signal S ('d, t) is a smooth function of d (regular case), but the statistician 
supposes that the observed model has singularities of cusp or discontinuous 
types. 

We discuss the conditions of the consistency, the rates of convergence and 
the limit distributions of the corresponding pseudo-MLE in such situations. 


3 Main results 

We suppose that the observed process [real model) is 

dXt = S{^o,t)dt + edWt, Xq = 0 , 0 < f < T, (3) 

where do is the true value of unknown parameter and S (d, •) G L 2 (0, T). If 
we use the theoretical model 

dXt = M['dR)dt + edWt, Xq = 0, 0 < f < T, 

with M (d, •) G L 2 (0, T), then the likelihood ratio (misspecihed) is 

E(d,X^) =exp|l^ M(d,t)dW-^^ M(d,f)'df|, dG0 (4) 

and the (pseudo) MLE d^ is dehned by the equation 

I/(d„X^) =supy(d,X^). (5) 

^ ^ i?ee 
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To understand what is the limit of the MLE we write the likelihood ratio as 
follows 

In y X^) 

= e [ M{^,t)dWt-l [ [M (d, tf - 2M (d, t) S (4, t)] dt 
Jo 2 Jq 

= 1 j M {{), t) dH-', - i ||M (d, ■) - S (d„, - jit + ||S (d„. - jf 

where we denoted as H-H the L 2 (0,T) norm. It can be easily verihed that 
under mild regularity conditions we have the convergence 

sup £2 In L (d, X^) -hlM(d,-)-S (Jo, •) ||' + 11^ (^ 0 , •) ll' ^ 0. 

i9G0 ^ 

Hence if we suppose that the equation 

inf||M(^?,-)-^(4,-)|| = M(J,-)-S(Jo,-) 

has a unique solution J, then we obtain the well-known result that in the case 
of misspecification the MLE converges to the value J, which minimizes the 
Kullback-Leibler distance. It is interesting to note that in general case J ^ Jq 
but sometimes J = Jq and we consider the conditions of the consistency in 
such situations. The most interesting for us is the question of the rate of 
convergence of the MLE to the true value. 

3.1 Discontinuous versus smooth 

Here we consider the situation where the true model (described by S (J,-) is 
smooth w.r.t. J but the theoretical model choosen by statistician is discon¬ 
tinuous. We start with a simple example. This example allows to see how 
we can have different rates of convergence of estimators and what is the limit 
distribution of these estimators. 

Example 1. Suppose that the observed process is 

dXt = (t — Jo) dt -l- sdWt, Xo = 0, 0 < t < T, (6) 

where Jo E Q = (a, /3), 0 < a < 13 < T and the theoretical model is 

dXt = sgn (t -J)dt + edWt, Xq = 0, 0 < t < T, JeQ. (7) 
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The pseudo-likelihood ratio is the function 


V (d, X'^) =exp<— sgn (f - h*) dXt - — L h* e 0 


because sgn (t — = 1. Note that 


■d = arg inf / [sgn (f — d) — (f — "do)]^ df = do- 
i?ee n 


Hence the MLE dg defined by the relation 


de = arg sup / sgn (t — d) dXt 
i9e0 Jo 

in this misspecified parameter estimation problem is consistent. 

To study its rate of convergence and the limit distribution we introduce a 
normalized likelihood ratio 

H(do,X^)’ 

where d^ = do -|- (peU. Here —)■ 0 will be choosen later and 


U.= 


a - dp /j - dc 

Pe ’ Pe 


-CXD, CX)) 


as £ —)■ 0. 


The substitution of the observation process in the likelihood ratio yields 
us the following expression (we suppose that u > 0) 


In {u) = - / [sgn (t - dp - Peu) - sgn {t - dp)] dlTt 

^ Jo 

+ \ [ [sgn (f - dp - peu) - sgn {t - dp)] {t - dp) dt 


2 / ^{^o<t<'&o+‘Peu} dp) dt 

£ ./n 


[hld9o+‘Pe« 


2 t-'dQ+ipeU 


(t - dp) dt 


hlPlw (u) - 

£ £^ £ 


3/2 2 ' 
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where we denoted the Wiener process 


W'+ (u) = %, 


u e 


0 , 


(3-^0 


u e 


0 , 


(3 -'do 


Therefore if we take then we can write 

s 2/3 r -S'! 

(m) = (w)) 2 = exp |W+(m) - y| , 

For the negative u we obtain a similar representation 

e2/3 r -2 

Z, («) = (Z, (^x))— = exp W_ (-u) - - 


where W- (u) ,u > 0 is a Wiener process independent of W+ (u) ,u > 0. 
Hence if we denote VF (■) a double-sided Wiener process, then the pseudo- 
likelihood ratio is 


a — dn 

u e I -^,0 


(m) = {Zs {u))~ = exp <1 VF (m) - — f , 


u e Ue. 


Now the properties of the MLE follow from the relations 


P^o I <x \ = P, 


£2/3 


P, 9 q sup V (id, > sup V (^d, 


= P 


V{^,X^) V{d,X^) 

^0 <1 sup ,, , q > sup ,, . „ yy. 

^<Ci)o+s‘^/3a; V (-do, x^) ,?>^0+e2/33. V (do, ) 



sup 

Zs(u) 

> 

sup 

Zs (u 

1 

u<x,u£Ve 



u>x,uGUe 


Pi?o j 

f 

, sup 

Zs(u) 

> 

sup 

Ze (U 

1 

u<x,uG'Ue 



u>x,uGVe 



where we denoted the solution of the following equation 
W{u,)-^ = snp{w{u)-- 

^ ueVe \ ^ 


It can be shown (see below) that 

P^o (its < x) 


P,?o (u < x) 



i.e.; 


u 

tip. u = arg sup ( ly (ti) —— 

u£R V 2 


Therefore we obtain the following 

Proposition 1 The pseudo-MLE t?^ in this problem is consistent, converges 
in distribution 


- A 

^2/3 


U 


and the moments converge: for any p > 0 


lim 


de - do 


^2/3 


= E\uf. 


The proof follows from more general result of the Theorem [T] below. 
Remark 1. Choosing different smooth signals in the class 

iS = (f — d) = sgn {t — d)\t — d\'^ 

and the same theoretical model o we can obtain different rates of conver¬ 
gence of estimators. Indeed, let us fix some k G (|,oo). Then the corre¬ 
sponding calculations like (jS]) provides us the expression 


In Zg (ti) = 


2 \/^ 


iy+(tt) 


Te 




u 




£ (1 -h it) 


Therefore if we put then 


Ze (ti) = exp < fy (ti) — 


ti 


1 + K 


1 -1- it 


ti G Up 


and the pseudo-MLE d^ is consistent and satisfies the relations 


de-do 

- 2 — = tip = arg sup 

£2k+1 neUe 


W{u) - 


ti 




1 -(- it 


ti, 
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where 


u = arg sup 

uGR 


W{u) 


\u\ 


I!+«: 


1 + K 


Therefore choosing different k > ^ we can obtain any rate £'^,7 < 1 of 
convergence of pseudo-MLE ; 


- 4 

e'y 


u. 


Return now to the general smooth model of observations 

dXt = S{'do,t)dt + edWt, Xq = 0, 0<t<T (10) 

and the discontinuous theoretical model 

dXt = M{^,t)dt + edWt, Xq = 0, 0 < t < T, (11) 

where the signal 


M (d, t) = h it) Ip<^} + g it) 


The unknown parameter d G 0 = {a, f3) with 0 < a < /3 < T. We observe a 
trajectory X^ = (X^, 0 < f < T) of the solution of the equation ([TU]) and we 
want to estimate "do supposing that the observed process is dll. Therefore 
we introduce the pseudo-likelihood ratio 


V (^, X^) =exp<— h (t) dXt + — 9 (t) dXt 


~7ro / d g 0 


2e2 


2^2 


and define the pseudo-MLE dg by the equation 

V U„X^) =supy (d,X^) 
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Let us introduce the following notations: 


S{t) = h (t) - git), 


^it))= [Mit),t)-Sit)o,t)fdt, 

Jo 

^{^) = 2 [h {iJ) - S (4, \h i^) - S' (4, 


7(d) = 


d e 0, 


Z (u) = exp <j 5i^)W (u) - 


u e R, 


Z° (v) = exp ■{w{v) — ^ , V e R 


u = argsup 

5(d) W(u) 

, V = argsup 

r vn 

w{v)- — 

ueR 

2 

u&R 

2 


Here dot means differentiating w.r.t. d, prime means differentiating w.r.t. t, 
W (u) ,u E R and w (v) ,v E R are double-sided Wiener processes. 


Note that 


u = V 


m 

7(d) ^ 


( 12 ) 


Indeed, let us put u = rv. Then we can write 


5id)Wiu) 




-u^ = \/rd('iJ)w (v) 


= ^/f5(d) 


w (v) 


7(d)r2 

2d(d) 


= v^S(d) 


w (v) - 

^ ^ 2 


if we put r = d(d)3 7 (d) s. Here w (v) = r (rv). This proves (IT^ . 

Conditions M.. 

1 . inf tee 5(f) > 0. 

2. The equation 


[h (t) - S (do, t)] dt+ [g (t) - S (do, t)] dt = inf $ (d) 

7.0 1?G0 
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has a unique solution = "& (tto) E 0. 

S. The functions h{t) ,g {t) and S {d, t) are continuously differentiable 
w.r.t. t E Q. 

4 . inf^ee $ fd) > 0, 

Note that as d is the point of minimum of the function <h fd) we have the 
equality 



12 


h{d)-S{do,d) - g{d)-S{do,d) 


1 2 


= 0 , 


(13) 


which is equivalent to 


(h{i) - 9(tf)) 




0 . 


Hence the point d satishes to the equality 

Of course, this is a necessary condition only. The equation 

S(i?o,«) = hh±?H, a<t<p (14) 

can have many solutions corresponding to the local maximums and minimums 
of the function $ (f), t G 0. If the equation ([TTl) has no solution, say, 

S{do, t) < 2 a<t < (15) 


then d = a. Otherwise d = (5. In these two cases the behavior of the 
estimator d^ can be studied as it was done in |2], Section 2.8. If we have the 
equality 


S{do,t) 


h{t)+9{t) 

2 


a < t < 


then any point of the interval {a, /3) can be taken as d. We do not study here 
the properties of d^ in this situation and in the situation when the function 
<h ("d) ,a < d < (5 has two or more points of minimum. Note that such study 
can be done by the same way as in |S], Section 2.7. 

The properties of the pseudo-MLE d^ are described in the following the¬ 
orem. 
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Theorem 1 Let the conditions Ai be fulfilled then the estimator converges 
to the value d, has the limit distribution 


and for any p > 0 


tie — 'd 


lim 


de — d 
-^TfT 


E-j^o I h 


p 


(16) 


(17) 


Proof. As before we study the normalized pseudo-likelihood ratio process 

V (l9 + (peU,X^'' 


^£ (m) = 


V 


U e Ue, 


where pe = ■ We have (u > 0) 


r-T , 


lnZ,(u) = - / 
t Jo 


M\^^ + PeU,t) -M{t},t 

TV, .2 


dX 


2^2 


'0 


M I'd + (peU-,t] — M ['d,t 


dt 


i^'d+tpsU 


[h{t)-g{t)]dWt 


1 A+ipeU 

— {[h (t) - S (do, t)f - [g (t) - S (do, t)]^) dt 






\ftpl 


<h (d + ) — <h (d 


2^2 




^h(d)W+ (u) - ^<h(d) + o (1) 


= £-2/3 


h(d)W+ (u) - 


<h(d) ^2 


2 2 


+ 0 ( 1 ). 


Here we introduced the Wiener process 


W+ (u) = - ^ -, u e 

y/tpe 


0 , 


/3-d 

p2/3 
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and used in the expansion of <h ('d + cpeU ) the equality 


For the negative values u < 0 we obtain the similar representation 


In Zg (u) = e 


- C--2/3 


S{d)W- i-u) 


2 y 


+ 0 ( 1 ) 


with independent Wiener process W_ (n) > 0. 

Introduce the random process 

Ze (u) = {Ze {u)Y = exp <1 6{'d)w (u) - + 0 (1) , u eVe 


We define Z^ [u) lineary decresing to zero on the interval 


£ 2/3 ’ ^ 2/3 


+ 1 


and increasing from zero to Z^ fhe interval 


a — ^ _ a — i} 


£2/3 


1 , 


£2/3 


Further we put Z^ {u) = 0 for 


u Y 


a — d ^ 13 — d 

1 ,^ 77 ^ + 1 


£ 2/3 


£2/3 


Now the process Z^ [u) is dehned for all u E R. Note that this process is 
continuous with probability 1. 

Let us denote by the measure induced by this process in the space 
Co {R) of continuous functions decreasing to zero at inhnity. The correspond¬ 
ing measurable space we denote as (Cq (R) , B), where B is Borelian cr-algebra. 
By Q we denote the measure of the limit process Z (■). 

From this representation we obtain immediately the hrst lemma. 

Lemma 1 IFe have the convergence of finite-dimensional distributions of 
Ze (•)•' for any set Ui,... ,Uk and any k = 1,2,... 

(ui) ,..., y (wfc)) ^ (^z (ui) ,...,z (wfc) j . ( 18 ) 

This convergence is uniform in d on compacts IK C 0. 
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We need the following elementary estimate 
Lemma 2 There exists a constant k > 0 such that 


$ (^ 9 ) - 


( 19 ) 


Proof. As the point d is a nnique minimum of the function $ ('d), we can 
write for any u > 0 

m (z/) = inf <h (d) — $(79) > 0. 


Hence for 


z9 — ?9 


> u 


(?9-d) 

$ {'&) — <h(?9) > m{h') > m (u) — - - 

iP-a) 


Further, for the values 


?9 — d 


< by Taylor expansion we have 


(^9) - <F(?9) = ^<F(d) (^z9 - (1 + o (1)). 

Therefore for sufficiently small z/ we can write 

$ (?9) - <F(z9) > ^<h(z9) . 

Taking 


K = min 


iP-af 4: ^ ^ 


we obtain ([T9]). 

This estimate allows us to verify the boundness of all moments of the 
pseudo likelihood ratio process. 

Lemma 3 For any p > 0 there exist constants c > 0 and d > 0 such that 
for all |n| > d 

E^„4^(n)<e-“^ (20) 
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Proof. Indeed, we have 


(uf = exp 


p- 


2^-2/3 p-S+iPsi 


S (t)^ dt — pe <I)('d + ^eu) — <I’('d) 


Now the estimate (120]) follows from the relations 

r-i?+ipeU 


,- 2/3 


,- 4/3 


6 (t) dt < sup 6 (t) 1^1, 
tee 


^>0 + PeU) - d>{d) 


> KU^, 


where we used (HI]). Therefore we obtain the estimate (120]) with some c > 0 
and d > 0. 


Lemma 4 For luil < A^, 1^21 < N and any N > d we have the estimate 


E 




{U2) - (ui) < C (1 + |«2 - «i| 


( 21 ) 


with some constant C > 0. 
Proof. Let us denote 


at 


= e bt = -e {t)[h{t) + g {t) - 2S {do^t)], 


G (t) = exp 
Note that 


a^dlTs + 


bAs 


I'd+ipeUi 


'tt+ipeUi 


G [id + p^U2 ) = 


Ze {U2) 
Ze (Mi) 


The process G [t) has stochastic differential 

. 2 " 


dG (t) = G [t) 
Therefore 


bt F dt F G (t) atdWt, (7 (?/ + p^ui ) — 1. 


G I d + P£U2 ) — 1 + 


''■d+ipeU2 


G{t) 


'■d+ipeUi 


btF^ 


r’S + ipi;U2 


dt + 


G(t) atdWf 


I'd+ipi.ui 
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We write 


E, 


1?0 


(U 2 ) - (til) = (uiy 

l‘i)+ipeU2 


G ( I? + ipeU2 ) - 1 


E^o-^e (^ 1 ) 

< C^Ei)X («i)' 


G{t) 


•d+^p^ui 

/•'§+ipsU2 

'S+tpeUi 


bt + 


dt + 


(•l}+tpeU2 


G (t) atdWt 


l-d+ipeUi 


Git) 


at 


bt + ^ 


dt 


C^E^X iX 


f>1?+V3e'U2 


Git) atdWt 


< Cl iu2 - Ui)^ ifl 


f>i)+ipeU2 


EXeiulfGitf 


'•d+ipeUl 


bt + 


/ ^ 8 

p'd+ipeU2 

1 E^o-^£ ("^i) E^o 

G it) atdWt 

V 

Ji)+(peUi 


dt 

1/2 


For stochastic integral we have the estimate 


E 


1?0 


(•l}+tpeU2 


Git) atdWt 


/ 'H+ipeUi 


f1?+99etJ2 


< CE^g 


G (t)^ afdt 


I '^f+ipeUi 


< iu2 - Ui)^ip 


3 ,„3 


f'0+ipeU2 


'■d+ipeUi 


a®E^„G(t)® dt. 


Further 


E^qG it) = exp < 32 


■ “ 
I'd+ipeUi 

p'd+ipeUl 


2 ds - 8£-^/3 


$ (t) — $ ( d + (peUi 


'0 


ds — 8e (d + ^sUiJ 


Ett^Z^iui) = exp <( 32 
Hence 

E^o Zg (ua) - iui) <C\u2-UiXC \u2 - Uif 

<C{1 + N^) \u 2 -u 1 f 

for |mi| < N and \u 2 \ < N. 

Now the properties flT^ and ffT7|) of the pseudo MLE de follow from the 
Lemmae [H [3l m and the Theorem 1.10.1 in [7]. 
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We see that the has a “bad” rate of convergence. Note that for other 
estimators the rate can be better. 

Let us study the trajectory fitting estimator ij* dehned by the relation 


ij* = arg inf / [Xt — m {tt, t)Y dt, 

Jo 

where 

m{'&,t) = [ M {d, s) ds. 

Jo 

Suppose that the function 

fT 

dt (d) = / [rni^dfi) — s {'do,t)Y dt, "d G 0 

Jo 

has a unique minimum at the point d* G 0. Here 

■s (do,t) = j S (do,n) dn. 

Jo 

This estimator admits the representation 

d* - d* fJ Wt m (d*, t) dt 


fY m (d*, tY dt 


(1 + 0 ( 1 )). 


Therefore this estimator is asymptotically normal with the rate £. The details 
of the proof can be found in the Section 7.4 in [ 8 ], . 


3.2 Smooth versus discontinuous 

Suppose now that the true model flTOl) has discontinuous trend coefficient 
S (do, t) of the following form 

dW= [h{t) 9 Y) fI{i>i?o}] dt + sdWt, Xq — 0, 0 < t < T, (22) 

where do G 0 = (a, /?), 0 < a < /5 < T, but the statistician uses the model 

dXt = M (d, t) dt + edWt, Xo = 0, 0 < t < T, (23) 

with the “smooth” signal M(d, •). The likelihood ratio L (d, and the 
pseudo-MLE dg are dehned by the same relations (jl]), ([5]). As before, we are 
interested by the asymptotic behavior of d^ as £ —)■ 0. 
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To show that the situation is quite different we start with the example 
which is “symmetric” to the Example 1. 

Example 2. Suppose that the observed process is 

dXt = sgn {t — do) dt + edWt, Xq = 0, 0 < t < T, 

and we use the model 


dXt = (t — d) dt + edWt, Xq = 0 , 0 < t < T, 

to estimate the parameter d G 0 = (a, /9), where d < a < (3 <T. 
It is easy to see that the function 

<h (d) = /” [t — d — sgn (t — do)]^ dt, d G 0 


atteints its minimum at the point 


T 2 

2 ' y u 


and therefore the pseudo-MLE 

- 2X 


d. = 


2T 


T 5 Wt _^ ^ 

= d-— e —)■ d. 


T 


It has Gaussian distribution 





~ Ar(0,T-^) 


and the rate of convergence is e. 

Note that if the true value is do = then it is “consistent” otherwise - 
not. Of course the consistent estimator can be constructed as follows 


d 


* 

e 



T 

'2 


^0 


but to do this we need to know the true model. 

Let us return to the problem with the equations fl22l) . fl23l) and introduce 
the conditions of regularity. 

Conditions TZ. 
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1. The functions h (■) and g (■) are bounded and for all do G [a, fd] we have 
h{do)^g{do). 

2. The function 


^{d)= [M{d,t)-S{do,t)fdt, dee 

Jo 

has a unique minimum at the point dee. 

3. The function M {d,t) is two times continuously differentiable w.r.t. d. 
4- The function 

^{d) = 2 [ M{d,t)\M{d,t) - S{do,t)]dt+ [ M{d,tYdt>0. 
Jo ^ ^ Jo 

Let us denote 


I(^?)= / M{d,tfdt, 
Jo 


(do) = ^{d)-%d). 


Theorem 2 Let the conditions TZ be fulfilled, then the estimator d^ converges 
to the value d, is asymptotically normal 


dr — d 


i^Af{0,B{dof), 


and for any p > 0 


lim E 

£—^0 




dr — d 

p 



= 


e 




Proof. Using the Taylor expansion we can write for the likelihood ratio 


V(d + eu,X j la-dfd-d 

Ze (u) = -7--N-, uel]e = 


V d,XT 


e e 


the presentation 


\nZriu)=u I M[d,t] dWt - —^(d) + o (1). 




Therefore, if we denote 


u 


Z{u)=exp{u I M{d,t)dWt- y u e R, 


then we obtain the hrst lemma. 
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Lemma 5 We have the convergence of finite-dimensional distributions of 
(•)•' Ui, ■ ■ ■ ,Uk and any k = 1,2,... 

{Ze (ui) {Uk)) ^ {Z (mi) ,...,Z {Uk)) ■ (24) 

This eonvergence is uniforme in id on eompacts IK C 0. 

The next lemma can be proved following the same arguments as the Lemma 

El 

Lemma 6 There exists a constant k > 0 such that 

(25) 


Note that now the moments of (u) are no more bounded. Denote 
= 'd -h £u, then for any 7 > 0 , we can write 


{uf 

= exp 


r-T , 


7 

7^ 7« \ „.2 


M(^9„) - M{d) 


dt 


1_ 
2£2 L 


- <h(79) 


<exp<'(^M-^) u^\ = l. 


(26) 


where we denoted 


M = sup / Mfdfdt 
i9g0 Jo 


and put 7 = M 

Therefore we introduce the following normalized likelihood ratio 
4 (n) = Z, {uf , n e U, 

and establish the properties of this process similar to Z^ (•) in Lemmae[3]and 

il 

Lemma 7 Suppose that the conditions TZ are fulfilled, then we have the es¬ 
timates 


(u) < e- 


E 


»?0 


{U 2 ) - zl^^ (m) <0(1 + N^) {U2 - uiY 


for \ui\ < A^, 1^21 < N 


(27) 

(28) 
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Proof. The first estimate fl27|) we obtain immediately from fl2^ . The proof 
of the second estimate (125]) can be carried ont like the proof of the relation 

The properties of the process Zg (•) established in the Lemmae 5 and 7 
allows to cite Theorem 1.10.1 in [7] and to obtain the annonnced in the 
Theorem [2] properties of the pseudo-MLE 


3.3 Discontinuous versus discontinuous 


Let us remind that if the the observed model is discontinuous and the statis¬ 
tician knows this but takes the wrong signals before and after the jump, then 
nevereless it is possible to have the consistent estimation. Consider the fol¬ 
lowing problem of parameter estimation in the situation of misspecification. 
The theoretical model is 


dXt = [h {t) g {t) Ip>^}] dt edWt, Xq = 0, 0 < t < T, 

where d G 0 = (a,/3), 0 < a < /3 < T. Suppose that h{t) — g it) >0 for 
t G [a,/d]. The observed stochastic process has a different equation 

dXt = \\h it) + q (t)] -f [g {t) + r (t)] llp>^o}] dt + edWt, 0 <t <T, 

where q {t) and r (t) are some unknown functions. 

We study the conditions on q (t) and r (t) which allow the consistent esti¬ 
mation of the parameter do. 


The function $ (d) for d < do is 


(|)(d)= / q{tYdt+ / [h (t) + q (t) - g {t)f dt + / r {tf dt. 

Jo 


Hence 


d$(d) 

dd 


If the function 


then for d < do 


= q (d)^ — [h (d) — g {■&) + q (d)]^ 

= - (h (d) - ^ (d)) [h (d) - ^ (d) + 2q (d)] . 

^7 (d)-h(d) 


q (d) > 


d G 0, 


(29) 


d$(d) 

dd 


< 0 . 
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For 19 > under condition 


r (i9) < 


2 


we obtain the similar inequality 


(30) 


d$(d) 

di9 


> 0 . 


Therefore 


i9 = arg inf <F (i9) = i9n 

i?e0 

and we obtain the following result. 

Proposition 2 If the conditions fl29p and (150]) are fulfilled then the pseudo- 
MLE ds is consistent. 

It can be shown that 


'ds-'do _, ^ 

For the details see the similar problem in Section 5.3, j8]. The close problem 
of change-point detection formisspecihed diffusion processes are studied in 

m 

3.4 Discussion 

There are several other interesting problems of misspecihcation in regularity, 
which can be studied by the proposed here approach. 

One of them is to study the asymptotic behavior of the bayesian estimator 
ds in the situation described by the equations fl22D . (I23|) . The estimator is 

- d.) 

where p {i9) ,a<i9<fiis continuous positive density of the distribution of 
the random variable d. 
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It can be shown that di. converges to the same value d. Then using the 
notations of the section 3.1 we can write 


= 

^ f/3 






^ ^^ J^^up(djz,(u) dn 

lu, P (^n) Z, (u) du ’ 


where we changed the variables d = i9u = "d + Hence 


-d u Ze {u) dn fu, “ (“) 

— “XTw* =7j~ 


2 £- 2/3 


dn 


2 £- 2/3 


dn 


and the problem reduces to the study of the asymptotics of these two integrals 
in the situation, when 


Z^ (u) = exp { 6{'&)W (u) - ® (1)) • 


We can suppose that the detailed study will provide us the asymptotics 

- ~ 7 / 

£2/3 

where u is as before the point of the maximum of the process 5{d)W {u) — 
This means that as usual in regular estimation problems the asymp¬ 
totic behavior of the BE is similar to that of the MLE. 

Another problem we obtain if we suppose that the obsevred process has 
a signal M (d, •) with a singularity of the cusp-type (theoretical model) but 
the observed process in reality has a smooth signal S ("d, •), i.e.; cusp versus 
smooth. Say, 


dX^ = a |f — dl'^ df-|-edWt, Xq = 0, 0 < t < T, (31) 


where k, G (0, 

Therefore the observed process is ([TU]) but the statistician calculate the 
LR V (d, X^) and the pseudo-MLE dg following (0]) and ([5]) respectively. It 
is clear that dg converges to the value 

fT 

d = arghif / [a |f — dj'^ — S' (do, t)]^ df 
J0 
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which minimizes the Kullback-Leibner distance and we are interested by the 
limit distribution of £~3-2k Por the details see the forthcomming 

work [ 2 ]. 

The properties of the MLE and bayesian estimatrs for the ergodic diffusion 
processes and inhomogeneous Poisson processes with cusp-type singularities 
are studied for example, in 0, m- For the general theory of the parameter 
estimation for different singular estimation problems see [7]. 
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